Approximation of Grammar-Based Compression via Recompression
نویسنده
چکیده
In this paper we present a simple linear-time algorithm constructing a context-free grammar of size O(g log(N/g)) for the input string, where N is the size of the input string and g the size of the optimal grammar generating this string. The algorithm works for arbitrary size alphabets, but the running time is linear assuming that the alphabet Σ of the input string can be identified with numbers from {1, . . . , N} for some constant c. Otherwise, additional cost of O(n log |Σ|) is needed. Algorithms with such an approximation guarantee and running time are known, the novelty of this paper is a particular simplicity of the algorithm as well as the analysis of the algorithm, which uses a general technique of recompression recently introduced by the author. Furthermore, contrary to the previous results, this work does not use the LZ representation of the input string in the construction, nor in the analysis.
منابع مشابه
Approximation of smallest linear tree grammar
A simple linear-time algorithm for constructing a linear context-free tree grammar of size O(rg + rg log(n/rg)) for a given input tree T of size n is presented, where g is the size of a minimal linear context-free tree grammar for T , and r is the maximal rank of symbols in T (which is a constant in many applications). This is the first example of a grammar-based tree compression algorithm with...
متن کاملA Space-Saving Approximation Algorithm for Grammar-Based Compression
A space-efficient approximation algorithm for the grammar-based compression problem, which requests for a given string to find a smallest context-free grammar deriving the string, is presented. For the input length n and an optimum CFG size g, the algorithm consumes only O(g log g) space and O(n log∗n) time to achieve O((log∗n) logn) approximation ratio to the optimum compression, where log∗n i...
متن کاملBone cement distribution in the vertebral body affects chances of recompression after percutaneous vertebroplasty treatment in elderly patients with osteoporotic vertebral compression fractures
OBJECTIVE Percutaneous vertebroplasty (PVP) is a surgical procedure that has been widely used to treat patients suffering from osteoporotic vertebral compression fractures (OVCFs). The procedure involves injection of bone cement into a fractured vertebra. In this study, we investigated whether the distribution of the cement in the vertebral body is related to the occurrence of recompression aft...
متن کاملApplication of Lempel-Ziv Factorization to the Approximation of Grammar-Based Compression
We present almost linear time (O(n · log |Σ|) time) O(log n)ratio approximation of minimal grammar-based compression of a given string of length n over an alphabet Σ and O(k · log n) time transformation of LZ77 encoding of size k into a grammar-based encoding of size O(k · log n). Computing exact size of the minimal grammar-based compression is known to be NP -complete. The basic novel tool is ...
متن کاملApproximation algorithms for grammar-based data compression
This thesis considers the smallest grammar problem: find the smallest context-free grammar that generates exactly one given string. We show that this problem is intractable, and so our objective is to find approximation algorithms. This simple question is connected to many areas of research. Most importantly, there is a link to data compression; instead of storing a long string, one can store a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Theor. Comput. Sci.
دوره 592 شماره
صفحات -
تاریخ انتشار 2013